Information mining: use of the document dimensions to analyse interactively a document set
نویسندگان
چکیده
This paper introduces a new interface that integrates document mining. A key point is the co-operation of different modules with which the user interacts in order to visualise in a global way a document set. This visualisation is done according to different point of views, given by the different document dimensions. The graphical visualisations are animated and 4-Dimensional. The general mining process follows the framework of a Knowledge Discovery in Databases process. We present the several phases implied by document mining and show how this technique can be used based on a document set.
منابع مشابه
A Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملA survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملDocument Analysis And Classification Based On Passing Window
In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...
متن کاملExamination of Vroom’s motivational theory: A new marketing strategy in consumers of online document delivery services: Case study of Shahid Chamran University of Ahvaz
This study aimed to identify and test expectancy motivational model as a theoretical framework to explain the reasons motivating expected information consumer’s behavior for the selection and use of document delivery services of Shahid Chamran University, Ahvaz. In this study, explanatory survey method was used. In order to test the hypotheses and analysis of model’s data, covariance structural...
متن کاملA New Document Embedding Method for News Classification
Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001